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(54) Decoding instructions 

(57) A decode unit (20) decodes instructions in a 
processor. These instructions include instructions of a 
first length in a first instruction mode and instructions of 
a second, shorter length in a second instruction mode. 
The decode unit has decoding circuitry (50-60) which 
decode the instructions. A register holds the instruction 
mode and generates an instruction mode signal. Switch- 
ing circuitry (MUX6,MUX7) is responsive to the instruc- 
tion mode signal to output decoded instructions from the 



decode unit depending on the instruction mode. A de- 
tector (70) is provided for detecting a length change in- 
struction of the second, shorter length while in the sec- 
ond instruction mode which indicates that the subse- 
quent instruction is of the first length. The detector also 
temporarily alters the state of the instruction mode sig- 
nal to allow the first length instructions to be decoded 
without changing the instruction mode held in the regis- 
ter. 
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Description 

[0001] The present invention relates to decoding in- 
structions in a computer system such as a processor. 
[0002] In a computer system, instructions are typically 5 
fetched from a program memory, decoded and supplied 
to an execution unit where they are executed to run the 
program stored in the program memory. It is advanta- 
geous for such a computer system to be able to support 
more than one instruction mode. A novel computer sys- 10 
tern described herein can support three instruction 
modes. 

[0003] According to a first instruction mode, during 
each machine cycle two 32 bit instructions are decoded, 
referred to herein as GP32 mode. is 
[0004] According to a second instruction mode, dur- 
ing each machine cycle a pair of 16 bit instructions are 
decoded, referred to herein as GP16 mode. 
[0005] According to a third instruction mode, four 32 
bit instructions are decoded during each machine cycle, 20 
referred to herein as VLIW mode. 
[0006] In practice, a prefetch unit fetches a word from 
memory having a length of 128 bits. This word can con- 
tain eight 16 bit instructions (GP16 mode), four inde- 
pendent 32 bit instructions (GP32) or four interrelated 25 
32 bit instructions (VLIW mode). The four 32 bit instruc- 
tions in VLIW mode are interrelated in the sense that 
they have to conform to a certain grammar such that 
they can be fetched and supplied to the decoder togeth- 
er. The prefetch unit supplies an 1 28 bit sequence to the 30 
decode unit on each machine cycle. However, the de- 
code units should supply to the execution unit decoded 
outputs only for the instructions to be decoded in that 
machine cycle, which depends on the instruction mode. 
[0007] In the second instruction mode, the nature of 35 
the 16 bit instructions can sometimes limit the manipu- 
lations to be carried out by the execution unit. It is there- 
fore advantageous to allow a 32 bit instruction to be in- 
cluded in a sequence of instructions in GP16 mode. If 
this is done however then it would be necessary to in- 40 
elude an instruction to alter the instruction mode of the 
computer system each time prior to execution of a 32 
bit instruction and then to have an additional instruction 
to change the mode back again to continue to decode 
and execute 16 bit instructions. 45 
[0008] According to the present invention in one as- 
pect there is provided a decode unit for decoding in- 
structions in a processor including instructions of a first 
length in a first instruction mode and instructions of a 
second, shorter length in a second instruction mode, the so 
decode unit including: decoding circuitry for decoding 
instructions; a register for holding the instruction mode 
and generating an instruction mode signal; switching cir- 
cuitry responsive to the instruction mode signal to output 
decoded instructions from the decode unit depending ss 
on the instruction mode; and means for detecting a 
length change instruction of the second, shorter length 
while in the second instruction mode which indicates 



that the subsequent instruction is of the first length and 
for temporarily altering the state of the instruction mode 
signal to allow the first length instructions to be decoded 
without changing the instruction mode held in the regis- 
ter. 

[0009] In another aspect the present invention pro- 
vides a processor comprising: at least one execution 
unit for executing instructions; an instruction mode indi- 
cator which indicates one of a plurality of instruction 
modes for the processor; a decode unit for decoding in- 
structions prior to dispatch to the at least one execution 
unit; and an instruction supply mechanism for supplying 
instructions to the decode unit, wherein said instructions 
are represented by bit sequences the length of which 
depends on an instruction mode of the processor, and 
wherein the decode unit comprises: decoding circuitry 
for decoding the instruction; switching circuitry respon- 
sive to an instruction mode signal generated by the in- 
struction mode indicator to output decoded instructions 
from the decode unit depending on the instruction mode; 
means for detecting a length change instruction of a sec- 
ond length while in a second instruction mode which in- 
dicates that the subsequent instruction is of a first length 
and for temporarily altering the state of the instruction 
mode signal to allow the first length instruction to be de- 
coded without changing the instruction mode held at the 
instruction mode indicator. 

[0010] In a further aspect, the invention provides a 
method of decoding instructions in a processor, the in- 
structions each being of a predetermined length and in- 
cluding a length change instruction which indicates that 
a subsequent instruction is of a different length, the 
method comprising: detecting the length change in- 
struction in a decode unit; temporarily altering the out- 
puts of the decode unit to permit the different length in- 
struction to be decoded; and after the different length 
instruction has been decoded, defaulting to decoding of 
the instructions of the predetermined length. 
[0011] The length change instruction is named herein 
as Gp32nxt. It thus allows a 32 bit instruction to be de- 
coded and executed while the machine remains in GP1 6 
mode. This gives GP16 mode a greater flexibility than it 
would have if all instructions were restricted to the 16 
bit length. Moreover, it overcomes the need to change 
the mode of the computer system just to execute a sin- 
gle 32 bit instruction in an instruction sequence of 16 bit 
instructions. 

[001 2] It will be appreciated that the length change in- 
struction and the following 32 bit instruction needs to be 
atomic, i.e. executable without any interrupts. This is be- 
cause once the decoder has detected the length change 
instruction, the next output from the decoder will be as 
though a 32 bit instruction has been decoded. 
[0013] The decode circuitry comprises in the pre- 
ferred embodiment a first decoder having an input con- 
nected to receive a bit sequence of the first length and 
operating on receipt of said bit sequence to generate a 
first decoded output; a second decoder having an input 
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connected to receive a bit sequence of the second 
length and operating on receipt of said bit sequence to 
generate a second decoded output; and a communica- 
tion path for supplying a bit sequence simultaneously to 
said first and second decoders. 

[0014] Thus, as described in the following, in order to 
manage different instruction modes, the decode unit has 
a plurality of dedicated decoders each of which receives 
and decodes the bit sequence during each machine cy- 
cle. Depending on the instruction mode of the machine, 
the outputs of selected one of the decoders are supplied 
to the execution unit for execution. The outputs of the 
other decoder are not required and thus are not select- 
ed. 

[0015] For a better understanding of the present in- 
vention and to show how the same may be carried into 
effect reference will now be made by way of example to 
the accompanying drawings in which: - 

Figure 1 is a block diagram of a prefetch unit; 
Figure 2 illustrates the different instruction modes 
of the processor; 

Figure 3 illustrates the organisation of a prefetch 
buffer; 

Figure 4 is a circuit diagram illustrating the key com- 
ponents of the prefetch buffer; and 
Figure 5 is a block diagram of a decode unit. 

[0016] Figure 1 is a block diagram of a prefetch unit 
2 for a processor, the prefetch unit 2 comprising a 
prefetch buffer 4 with associated control bits 6 and con- 
trol circuitry comprising a pref etcher 8 and an aligner 1 0. 
The pref etcher 8 is connected to a program memory 1 2 
and is responsible for initiating memory accesses to the 
program memory 12 using memory access control sig- 
nals 14a, 14b. The address in memory to which a fetch 
is initiated is held in a prefetch program counter 16 in 
the prefetcher 8. Control of the prefetch program coun- 
ter is not discussed herein, but it can be assumed that 
fetches are initiated from memory in accordance with a 
sequence of instructions to be executed by the proces- 
sor. That is, the prefetch program counter may be incre- 
mented each time as a sequence of adjacent instruc- 
tions is fetched, or it may change according to branches, 
traps, interrupts etc. Responsive to a memory fetch ini- 
tiated by the prefetcher, instruction words are supplied 
from the program memory 12 to the prefetch buffer 4 as 
represented by data-in path 18. 

[0017] The aligner 10 controls reading of instructions 
from the prefetch buffer to a decode unit 20 along data- 
out path 22. To do this, the aligner issues and is respon- 
sive to prefetcher align (PFAL)/decoder control signals 
24a,24b. The aligner 10 has an align program counter 
26 which keeps track of how many instructions have 
been dispatched to the decoder 20 in each machine cy- 
cle, and a state machine 28 which generates a read 
pointer RP for controlling the prefetch buffer in a manner 
which is described in more detail hereinafter. 



[0018] Instructions in the program memory 12 can 
have a length of 16 bits or 32 bits. The prefetch buffer 
supports three different instruction modes as described 
with reference to Figure 2 as follows. The instruction 

s mode is held in a process status register (PSR) 3 at the 
decode unit 20 and can be changed. The change of in- 
struction mode can be caused for example by specific 
change mode instructions which set the PSR according- 
ly. Change mode signals chmd1,chmd2 are issued by 

10 the decoder 20 responsive to a change in instruction 
mode. 

[0019] According to a first instruction mode, two in- 
structions each having a length of 32 bits are supplied 
to the decoder from the prefetch buffer in each machine 
is cycle, for example w0,w1 in CYCLE 0. This mode is re- 
ferred to herein as GP32 mode. 

[0020] According to a second instruction mode, a pair 
of 16 bit instructions are supplied during each machine 
cycle to the decoder 20 from the prefetch buffer 4. This 
pair is denoted slolO.slotl in bit sequences w0,w1 etc. 
This is referred to herein as GP1 6 mode. 
[0021] According to a third instruction mode, four in- 
structions w0,w1 I w2,w3 each of 32 bits in length are 
supplied to the decoder in each machine cycle. This is 
referred to herein as VLIW. In VLIW mode the instruc- 
tions in each instruction word (128 bits) conform to cer- 
tain grammatical rules. 

[0022] In all modes, each fetch operation initiated to 
the program memory 12 retrieves an instruction word of 
128 bits in length. Thus, in GP16 mode, the instruction 
word comprises eight 16 bit instructions, paired as 
slotO.slotl for each machine cycle. In GP32 and VLIW 
mode, the instruction word comprises four 32 bit instruc- 
tions. 

[0023] The organisation of the prefetch buffer 4 is il- 
lustrated in Figure 3. In diagrammatic terms, the 
prefetch buffer can be considered to have four succes- 
sive lines L0 to L3, each having a capacity of 128 bits. 
There is a single write port WPO having a width of 1 28 
bits which receives data from the program memory via 
the data-in path 18 and an input latch FF-in and writes 
it into the selected line under the control of a write point- 
er WP [3:0]. Each line comprises four storage locations 
each having a capacity of 32 bits and each of which is 
shown diagrammatically divided into two 16 bit sections 
for the purposes of explanation. The storage locations 
are denoted F0 to F15. Each line in Figure 3 is referred 
to herein as a group of storage locations and has the 
capacity for one 128 bit line from memory. This allows 
up to four successive memory accesses to be made, 
even if the first instruction word has not been received 
or executed by the processor. While the instruction word 
in storage locations F0 to F3 is being decoded and sub- 
sequently executed, memory fetches can continue to be 
implemented into the storage locations F4 to F7, F8 to 
F11 and F12 to F15 until the buffer is full. By the time 
that a memory fetch has been made into the last group 
F1 2 to F1 5, it is most likely that the first group FO to F3 
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will have been completely read out into the decode unit 
and will thus be ready to receive a subsequent instruc- 
tion word from memory. The number of cycles required 
to decode an instruction word in each group varies de- 
pending on the instruction mode of the machine in a 
manner which will be described in more detail in the fol- 
lowing. Nevertheless, a minimum of one cycle is re- 
quired for reading and decoding, and therefore the use 
of the prefetch buffer hides memory latency. 
[0024] In order to save a cycle when the prefetch buff- 
er is empty or flushed after a branch, data can bypass 
the prefetch buffer through a bypass circuitry BS. As de- 
scribed in more detail later, the bypass circuitry is im- 
plemented as a plurality of multiplexors (MUXO to MUX3 
in Figure 4). 

[0025] Figure 4 is a more detailed diagram of the 
prefetch buffer and its associated read circuitry. The 
storage locations FO to F15 are illustrated aligned ver- 
tically for the purposes of explanation. 
[0026] The control bits 6 described above in Figure 1 
include empty flags EF1 to EF4 which indicate when a 
complete 128 bit line of storage locations is empty such 
that a subsequent memory fetch can be initiated. When 
a fetch is instituted from memory, and data has been 
received by the prefetch buffer, the empty flag is cleared 
to indicate that those storage locations are now full. 
[0027] Reading from the prefetch buffer will now be 
described with reference to the schematic diagram of 
Figure 4. The prefetch buffer includes four read ports 
RP1 ,RP2,RP3 and RP4. These read ports each take the 
form of multiplexors each capable of connecting select- 
ed ones of the storage locations F0 to F15 to a 32 bit 
output, pf-buf-outl ,2,3 or 4. However, the read ports are 
not identical. The first read port RP1 has sixteen inputs 
each of which is connected to a respective storage lo- 
cation F0 to F15 and each of which can be connected 
to the output pf-buf-out1 . The second read port RP2 has 
eight inputs which are respectively connected to storage 
locations F1 ,F3,F5,F7,F9 : F11 1 F13,F15 to selectively 
connect the contents of those storage locations to the 
output pf-buf-out2. 

[0028] The third read port RP3 has four inputs con- 
nected to storage locations F2.F6.F10 and F14 for se- 
lectively connecting the contents of those storage loca- 
tions to the output pf-buf-out3. The fourth read port RP4 
also has four inputs which are connected to storage lo- 
cations F3,F7,F11 and F15 for selectively connecting 
the contents of those storage locations to the output pf- 
buf-out4. 

[0029] The read ports RP1 to RP4 are controlled by 
the read pointer RP from the aligner 10 in dependence 
on the instruction mode of the machine and the conse- 
quential number of machine cycles required for decod- 
ing each instruction word. 

[0030] Alternatively, for instructions supplied directly 
from memory along data-in path 18, the control of in- 
structions supplied to the decoder in dependence on the 
instruction mode and machine cycles is additionally con- 



trollable by multiplexors MUX0.MUX1 ,MUX2 and 
MUX3. These receive at their input respective bits of the 
128 bit data-in path 18 to supply a 32 bit sequence to 
each multiplexor in each machine cycle as described in 
5 the following. 

[0031] The selection of which instructions within the 
instruction word are supplied to the decode unit 20 is 
made on dependence on the instruction mode as de- 
scribed in the following. In Figure 3, the symbols wO to 

10 w3 are used on different input lines of the multiplexors 
MUXO to MUX3 to represent different 32 bit sequences, 
as in Figure 4. The definition of each 32 bit sequence 
depends on the instruction mode, but bits of the data-in 
path are always allocated as wO [0:31], w1 [32:63], w2 

is [64:95], w3 [96:127]. The inputs to the multiplexors are 
individually labelled so as to distinguish between them. 
That is, in GP16 mode, on the first decode cycle, cycle 
0, the first sequence wO is supplied to the decoder 20. 
This presents a pair of 16 bit instructions, slotO.slotl .; 

20 (wO) for simultaneous decoding by the decode unit 20. ' 
On the next cycle, cycle 1 , the sequence w1 is supplied, 
presenting the next pair of 16 bit instructions slotO.slotl : 
(w1) for decoding. In GP16 mode, the read port RP1 
and the multiplexor MUXO are the only read devices 

2S which are used and the control of the word which is sup- 
plied to the decode unit is made by the multiplexor 
MUXO under the control of signal mux-ctr10, and the 
read pointer RP. If the signal mux-ctr10 selects the read 
port output pf-buf-out1 , the read pointer selects inputs 

30 F0 to F3 over four successive cycles CYCLE0 to 
CYCLE3 to read out successively wO to w4. Once stor- 
age location F3 has been read out, the read port counter 
will reset the read port RP1 so that it reads out from stor- 
age locations F4 to F7 over the next four cycles. If the 

35 buffer is not in use, the first instruction pair wO is read 
out by the multiplexor MUXO. That is in cycle 0, input 
MOO of the multiplexor MUXO is selected. Meanwhile, 
the 128 bit line is loaded into the first location of the 
prefetch buffer and the read pointer points to the next 

40 location to be read out by the decode unit. Therefore on 
cycle 1, the next instruction pair w1 is read out by the 
multiplexor MUXO by selecting pf-buf-out1 
[0032] In GP32 mode, in the first machine cycle the 
first two instructions w0,w1 are presented to the decode 

45 unit 20. In the subsequent cycle, cycle 1, the next two 
instructions w2,w3 are presented to the decode unit. 
This utilises read ports RP1 and RP2 and the multiplex- 
ors MUXO and MUX1 . If the signal mux-ctrlO is set to pf- 
buf-outl, and mux-ctrl1 to pf-buf-out2, then the read 

so pointer RP is set to F0 for RP1 and F1 for RP2 in cycle 
0. In cycle 1, it is changed to F2 and F3 respectively. 
Instructions are then read over the next two cycles from 
the next group of storage locations F4 to F7 by altering 
the setting of the read ports RP1 and RP2 responsive 

ss to the read pointer RP. Alternatively, when read from the 
data-in path 18, in the first cycle, the first input M10 of 
the multiplexor MUX1 is set to read w1 (bits 31 to 63) 
and the first input MOO of the multiplexor MUXO is set to 
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read wO (bits 0 to 31 ). Thus, instructions wO and w1 are 
presented to the decode unit 20 in CYCLE 0. Mean- 
while, the 128 bit line is loaded into the prefetch buffer 
so that in the subsequent cycle, CYCLE 1, w2 and w3 
are read from the buffer by selecting pf-buf-out1 and pf- 
buf-out2. 

[0033] In VLIW mode, four 32 bit instructions WO to 
W3 (slotO to slot3) are supplied simultaneously to the 
decode unit 20 in each machine cycle, e.g. CYCLE 0. 
The multiplexors MUX2 and MUX3 are set according to 
the control signals mux-ctrl2 and mux-ctrl3 respectively 
to allow the instruction words w2 and w3 to be read ei- 
ther from the buffer or from the data-in path 1 8. In other 
respects, the settings of RP1 and RP2, MUXO and 
MUX1 are as in GP32 mode. However, in the subse- 
quent cycle, e.g. CYCLE 1 in VLIW mode, it will be no- 
ticed that the instruction words w2 and w3 which would 
have been remaining in GP32 mode have now been 
read out. Therefore, the read pointer RP can immedi- 
ately move on to the next set of storage locations F4 to 
F7 to read out the subsequent VLIW instruction word 
containing the next four instructions. 
[0034] Data is passed from the multiplexors MUXO to 
MUX3 to respective output flip-flops FF0 to FF3 via a 
set of control gates labelled GC 1 , GC2 and GS0 to GS3. 
The control gates GC1 : GC2 are responsive to change 
mode signals chmdl ,chmd2 respectively which indicate 
to the prefetch unit that there has been a change in the 
• instruction mode in which the machine is operating. The 
control gates GS0 to G S3 are responsive to respective 
stop signals stop[0] to stop[3] to prevent any new data 
from entering the decode unit from that output flip-flop. 
These effectively allow the decode unit to be stalled. In 
a stop condition, the outputs of the flip-flops are recircu- 
lated to the input of its associated control switch to pre- 
vent unnecessary operation of the subsequent decoder. 
[0035] Operation of the prefetch unit responsive to the 
change mode signals chmdl and chmd2 will now be de- 
* scribed. The output flip-flop FF0 is connected to a single 
32 bit decoder and to two 16 bit decoders. When the 
machine is in GP16 mode, the outputs of the two 16 bit 
decoders are selected for the instruction pair supplied 
to the flip-flop FFO. When the machine is in GP32 mode, 
the output of the 32 bit decoder is selected. The remain- 
ing flip-flops FF1 to FF3 are each connected to respec- 
tive 32 bit decoders. 

[0036] A first change mode signal chmdl signals a 
change of machine instruction mode from GP32 to 
GP16. If the machine had been operating in GP32 
mode, consider the situation at the end of cycle 0 which 
reference to Figure 2. Instructions wO and w1 will have 
been supplied via the flip-flops FFO and FF1 to the re- 
spective 32 bit decoders of the decoder 20. However, 
the change in instruction mode now implies that the 32 
bit sequence which was formerly to be considered as 
the second instruction W1 in cycle 0 of GP32 mode, in 
fact contains a pair of 16 bit instructions as denoted in 
cycle 1 of GP16 mode. Thus, the output of the 32 bit 



decoder connected to the flip-flop FF1 needs to be ig- 
nored, and the 32 bit sequence w1 needs to be reapplied 
to the two 16 bit decoders connected to the output flip- 
flop F0. This is achieved by the recirculation line 42 from 
5 the output of the flip-flop FF1 to the input of the control 
gate CG1. 

[0037] Conversely, control signal chmd2 denotes a 
change of instruction mode from GP16 to GP32. Con- 
sider again the effect at the end of cycle 0 with reference 

10 to Figure 2. The instruction pair denoted wO has just 
been decoded in GP16 mode, and the expectation is 
that the machine will now wait for the next instruction 
pair w1. However, in GP32 mode, that word w1 repre- 
sents a single instruction and the change mode signal 

is chmd2 allows it to be applied directly through the control 
gate GC2 to the output flip-flop FF1 so that it can be 
applied directly to the input of the dedicated 32 bit de- 
coder connected to the output of the flip-flop FF1 . This 
allows the instruction w1 to be decoded as a single 32 

20 bit instruction. In the next cycle, instructions w2 and w3 
can be transmitted normally as indicated by cycle 1 in 
GP32 mode in Figure 2. 

[0038] It will be clear from the above that the number 
of cycles needed to read all four storage locations in a 

25 group depends on the instruction mode. Thatis; in GP16 
mode, four cycles are needed, in GP32 two cycles are 
needed and VLIW one cycle is needed. When all the 
storage locations F0 to F3 in the first group have been 
read, the first empty flag EF1 is cleared to empty. 

30 [0039] The aligner controls the setting and clearing of 
the "empty" flags using information from the read point- 
er. The aligner detects when the read pointer goes from 
one line (128 bits) to the next. When this occurs, the 
■empty 1 flag corresponding to the page which has just 

35 been read is set. 

[0040] The state of an empty flag being cleared is de- 
tected by the prefetcher 8 along line 48 and a fetch is 
initiated to the next prefetch address in the prefetch pro- 
gram counter 16. Thus, the next instruction line is 

40 fetched from memory and the write pointer WP is set to 
write it into storage locations F0 to F3. In the meantime, 
the read pointer has moved to the second group F4 to 
F7 to read and decode instructions of that group. When 
those storage locations are empty, the empty flag EF2 

45 is cleared, a next memory fetch is initiated by the 
prefetcher 8 and the read pointer moves onto the group 
F8 to F11. As can readily be seen, the prefetch buffers 
masks a latency of memory fetches of at least three cy- 
cles in the VLIW mode, and a greater number of cycles 

so jn GP32 and GP16 mode. Signals are supplied from the 
decoder along line 24b to the aligner 10 indicating what 
mode the decoder is operating in so that the aligner can 
adjust the align program counter 26 accordingly and 
keep track of the next instructions to be decoded so that 

55 the read pointer RP can correctly be issued by the state 
machine 28. The control signals 24b sent from the de- 
code unit 20 to the aligner include an acknowledge bit 
which controls the aligner, in particular the program 
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counter 26. 

[0041] Figure 5 illustrates the details of the decode 
unit 20. The decode unit comprises six decoders 
50,52,54,56,58 and 60. Four of the decoders 50,54,58 
and 60 are 32 bit decoders which are labelled 
DEC32-0.1 ,2 and 3 respectively because they are each 
associated with the output devices FF0 to FF3 of the 
prefetch buffer illustrated in Figure 4. There are two 16 
bit decoders 52,56 which are labelled DEC1 6-0,1 re- 
spectively. Both of the 16 bit decoders 52,56 are asso- 
ciated with the first output device FFO. The 32 bit decod- 
ers 50,54,58 and 60 are hard-wired by 32 bit communi- 
cation paths respectively to the output devices F0 to F3. 
These communication paths are denoted by the dotted 
lines CP0 to CP3 in Figure 6. The first 16 bit decoder 52 
is hard-wired to the 16 least significant bits of the output 
of the output device FFO, and the second 16 bit decoder 
56 is hard-wired to the 16 most significant bits of the 
output of the output device FFO. Thus, the bits of each 
instruction are supplied as a parallel sequence to the 
decoders. 

[0042] The outputs of the top two decoders 50 and 56 
are supplied to a first decode multiplexor MUX6 which 
has an output connected to an instruction dispatch unit 
62. The outputs of the second two decoders 54,56 are 
connected to a second decode multiplexor MUX7, the 
output of which is also connected to a dispatch unit 62. 
The outputs of the lower most decoders 58,60 are con- 
nected directly to the dispatch unit 62. The decoders 50 
to 60 are always "on". That is, on each machine cycle 
they receive the bits on the connection paths CP input 
to the decoders, perform a decode operation and supply 
an output. It can readily be seen therefore that of the top 
four decoders, the outputs of only two of them in each 
case are of interest. For example, in GP32 mode, it is 
the outputs of the decoders 50,54 which are required - 
the outputs of the decoders 52,56 being redundant. A 
mode signal MODE supplied to the multiplexors 
MUX6.MUX7 selects the appropriate outputs in depend- 
ence on the instruction mode of the machine. 
[0043] Although the decoders are normally "on", they 
are responsive only to change in state of the bits sup- 
plied to them from the output devices FFO to FF3. If be- 
tween two machine cycles, there is no change in state 
in the outputs of the output devices, then the connected 
decoder will not need to change any of its internal state. 
Thus, no power will be consumed by that decoder for as 
long as the outputs of the output device FFO to FF3 con- 
nected to is do not change. Therefore, by recirculating 
the outputs using the control switches GS0 to GS3 re- 
sponsive to the stop signals stop[0] to stop[3] when the 
relevant decoders are not required, a power saving fea- 
ture is introduced. 

[0044] It is sometimes useful to be able to use a 32 
bit instruction in GP16 mode because the instructions 
in GP16 mode are more limited, while those in GP32 
mode allow more complicated operations. Therefore the 
present system in GP16 mode allows an instruction se- 



quence: 

(i) Gp32nxt 

(ii) Instr (32 b) 

5 

where (i) is the name of a specific instruction and (ii) 
represents any 32 bit instruction. Instruction (i) has the 
effect of allowing execution of eight of the 32 bit instruc- 
tions while remaining in GP16 mode. The instruction (i) 
10 is detected in the decoder and causes the following 
changes in operation. 

[0045] The decoder recognises a Gp32nxt instruction 
in the detector 70 and causes a MODE signal to switch 
MUX6 to supply the output of the 32 bit decoder 50 on 

15 the next cycle. The decoder acknowledges the subse- 
quent 32 bit instruction to the aligner as two 16 bit in- 
structions. On the subsequent cycle, the MODE signal 
to MUX6 reverts back to select the output of the 16 bit 
decoder 52 for decoding of subsequent 16 bit instruc- ( 

20 tjons. There is no change in the process status register r 
PSR of the machine . The Gp32nxt instruction avoids the 
need for wasteful execution of change mode instruc- : 
tions just to execute a single 32 bit instruction while in 
GP16 mode. It will be appreciated that the sequences 

25 (j), (jj) above must however be atomic. 



Claims 

30 1 . A decode unit for decoding instructions in a proces- 
sor including instructions of a first length in a first 
instruction mode and instructions of a second, 
shorter length in a second instruction mode, the de- 
code unit including: 

35 

decoding circuitry for decoding instructions; 
a register for holding the instruction mode and 
generating an instruction mode signal; 
switching circuitry responsive to the instruction 
40 mode signal to output decoded instructions 

from the decode unit depending on the instruc- 
tion mode; and 

means for detecting a length change instruction 
of the second : shorter length while in the sec- 

45 ond instruction mode which indicates that the 

subsequent instruction is of the first length and 
for temporarily altering the state of the instruc- 
tion mode signal to allow the first length instruc- 
tions to be decoded without changing the in- 

50 struction mode held in the register. 

2. A decode unit according to claim 1 , wherein the de- 
coding circuitry comprises: 

55 a first decoder having an input connected to re- 

ceive a bit sequence of the first length and op- 
erating on receipt of said bit sequence to gen- 
erate a first decoded output; 
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a second decoder having an input connected 
to receive a bit sequence of the second length 
and operating on receipt of said bit sequence 
to generate a second decoded output; and 
a communication path for supplying a bit se- 
quence simultaneously to said first and second 
decoders. 

3. A decode unit according to claim 2, which compris- 
es a third decoder having an input connected to re- 
ceive a bit sequence of the second length and op- 
erating on receipt of said bit sequence to generate 
a third decoded output, wherein the second length 
is half the first predetermined length. 

4. A decode unit according to claim 3, which compris- 
es an input buffer for holding a bit sequence of the 
first length and simultaneously supplying said bit 
sequence to the first decoder and to each of the sec- 
ond and third decoders. 

5. A decode unit according to any preceding claim, 
wherein in accordance with the first instruction 
mode, two instructions each having a length of 2n 
bits are simultaneously decoded in a machine cy- 
cle. 

6. A decode unit according to any preceding claim, 
wherein according to the second instruction mode, 
each bit sequence has 2n bits and comprises two 
instructions each of bit length n which are decoded 
in a machine cycle. 

7. A decode unit according to claim 5 or 6, wherein n 
equals 16. 

8. A decode unit according to any preceding claim, 
wherein the switching circuitry comprises a multi- 
plexor. 

9. A processor comprising: 

at least one execution unit for executing in- 
structions; 

an instruction mode indicator which indicates 
one of a plurality of instruction modes for the 
processor; 

a decode unit for decoding instructions prior to 
dispatch to the at least one execution unit; and 
an instruction supply mechanism for supplying 
instructions to the decode unit, wherein said in- 
structions are represented by bit sequences the 
length of which depends on an instruction mode 
of the processor, and 
wherein the decode unit comprises: 
decoding circuitry for decoding the instruction; 
switching circuitry responsive to an instruction 
mode signal generated by the instruction mode 



indicator to output decoded instructions from 
the decode unit depending on the instruction 
mode; 

means for detecting a length change instruction 
5 of a second length while in a second instruction 

mode which indicates that the subsequent in- 
struction is of a first length and for temporarily 
altering the state of the instruction mode signal 
to allow the first length instruction to be decod- 
10 ed without changing the instruction mode held 

at the instruction mode indicator 

10. A method of decoding instructions in a processor, 
the instructions each being of a predetermined 
is length and including a length change instruction 
which indicates that a subsequent instruction is of 
a different length, the method comprising: 

detecting the length change instruction in a de- 
code unit; 

temporarily altering the outputs of the decode 
unit to permit the different length instruction to 
be decoded; and 

after the different length instruction has been 
25 decoded, defaulting to decoding of the instruc- 

tions of the predetermined length. 
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